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Abstract 


In some long term studies, a series of dependent and possibly truncated life- 
times may be observed. Suppose that the lifetimes have a common marginal 
distribution function. In left-truncation model, one observes data (X;, T;) 
only, when T; < X;. Under some regularity conditions, we provide a strong 
representation of the Bn estimator of 6 = P(T; < X;), in the form of an 
average of random variables plus a remainder term. This representation en- 


ables us to obtain the asymptotic normality for Be 
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1 Introduction 


In medical follow-up or in engineering life testing studies, one may not be able 
to observe the variable of interest, referred to hereafter as the lifetime. Among 
the different forms in which incomplete data appears, right censoring and left- 


truncation are two common ones. Left truncation may occur if the time origin of 
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the lifetime precedes the time origin of the study. Only the subjects which are 
failed after the start of the study are followed, otherwise they are left truncated. 
Woodroofe [9] reviews examples from astronomy and economy where such data 
may occur. 

Let X1, X2,... be a sequence of the lifetime variables which may not be mutu- 
ally independent, but have a common continuous marginal distribution function 
F. Let T,,T2,... be a sequence of independent and identically distributed ran- 
dom variables with continuous distribution function G. They are also assumed 
to be independent of the random variables X;,’s. In the left-truncation model, 
(X;, Tj) is observed when T; < X;. Let (X1,71),...,(Xn,TIn) be only the sample 
one observes (i.e., T; < X;), and 8 > 0, where 


B= P( <X,) = if ” G(s)dF(s), (1) 


is the truncation probability (TP). 

Assume, without loss of generality, that X; and 7;,7 = 1,...,n, are non- 
negative random variables. For any distribution function H, we denote the 
left and right endpoints of its support by ag = inf{z : H(z) > 0} and bg = 
sup{z : H(z) < 1}, respectively. Then under the current model, as discussed 
by Woodroofe [9], we assume that ag < ap and bg < br. Equation (1) suggests 
estimating (6 by 

bn = [ Guls)aFn(s), (2) 
—oo 
provided good estimates F;, and G,, for F and G can be obtained. 

For the case in which the lifetime observations are mutually independent, 
Woodroofe [9] proved that if Ff, and G, are product-limit estimates (given by 
(4) below), 6,, converges in probability to 6 as n > oo. Under similar conditions 
as in Woodroofe [9], the asymptotic normality of /n(6,, — 6) has been investi- 
gated by several authors using different methods. Chao [7] used influence curves 
and Keiding and Gill [6] used finite Markov processes and the well-known delta 
method. Since F, and G, have complicated product-limit forms, the properties 


of 8, is generally not easy to study. Let J(A) denotes the indicator function of 
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the event A. He and Yang [4] proposed, instead, another estimate of 3 as 
= _ Gn(z)(1— Fa(a)) 
n C, (x) b) 


for all « for which C,,(x) > 0, where 


Caan SI eae x), 
w=1 


is the empirical distribution of 
O(2) = P(T, <2 < X1|T, < X1) =6 "(1 - F(x))G(2). 


Using Bn, He and Yang [4] proved the almost sure convergence of the estimate 
of @ and obtained a manageable i.i.d. representation for Bn hence the asymptotic 
normality of the estimate. 

Our basic aim in this article is to express the TP estimator Bn as an av- 
erage of a sequence of bounded random variables plus a remainder of order 
O(n-'/2(logn)~°) for some 6 > 0, for the case in which the underlying life- 
times are assumed to be a-mixing whose definition is given below. As a result, 
the asymptotic normality of TP estimator is obtained. 

Let F denote the o-field of events generated by {Yj;i < j < k}. For easy 
reference, let us recall the following definition. 

Definition. Let {Y;,i > 1} denote a sequence of random variables. Given a 


positive integer n, set 


a(n) = sup{|P(AN B) ~ P(A)P(B)| Ae FRB EFS}. (3) 


The sequence is said to be a-mixing (strongly mixing) if the mixing coefficient 
a(n) + 0 as n > oo. Among various mixing conditions used in the literature, 
a-mixing is reasonably weak and has many practical applications (see, e.g. [1] 
for more details). In particular, the stationary autoregressive-moving average 
(ARMA) processes, which are widely applied in time series analysis, are a-mixing 


with exponential mixing coefficient, i.e., a(n) =e-””, for some v > 0. 
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The rest of the present paper is organized as follows. In Section 2, we provide 
the strong representation results for the TP estimator. The proofs are given in 


Section 3. 


2 Strong representation for the TP estimator 


We first introduce some notation before stating the strong representation result. 


The random truncation model is defined by the joint distribution 


H (x,t) = P(X, < x, T; < t|Ty < X}) 


with marginal distributions, 
F* (x) = P(X, < a|T; < Xi) = of G(s)dF(s), 
0 
and 
G*(a) = P(T, <alT, < Xy) = aa (1 — F(s))dG(s). 
0 


Let Fy and G7} be the empirical distributions of F* and G* defined by 
PG) = ns Ie a): cand: Gi) Sr Ty aT a, 
The well-known product-limit (PL) estimates of F;, and G,, are defined by 


es 1 ~ 1 
F,(a) =1— (USCA 1 Gn = aa 4) 
For construction of these estimates, see [9] or [7]. Suppose 
dF*(s) dG*(s) 
CX(s) < 0O and OPsy < oo. (5) 
Le 
I(x < y) BANG SOG) a5 
vi (2, t, y) _ O(a) ai C(s) dF (s), 
and 
vo(a,t,y) = oe =! | aes) 
y 


Then, Fy (Xi, Ti, y) = Ewe(Xi, Ti, y) = 0, and 


yi Ay2 dF*(s) 


Covu(pr(Xi, Ti, yr), Wi (Xi, Ti, y2)) = | “C2(s) ’ 


ap 
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and 
bax dG*(s) 
C?(s) - 


Cov(io(Xj,T:, 91), b2(Xi, Ts, yo) = y 


yiAy2 
The following theorem provides the strong representation for Bn 
Theorem 2.1. Suppose that {X,,;n > 1} is a sequence of stationary a-mixing 


random variables with a(n) = O(n~”), for some v > 3. If ag < ap, then 
ut ie 
Bn — B = —B— S| W(Xi, Ti) + Rn(y), (6) 
i=l 
is uniformly in 0 < y <6 < bp, where 


sup |Rn(y)| = O(n-!/*(logn)~°) as. 
O<y<b 


for some 6 > 0 depending only on v. We next present the asymptotic normality 
of the TP estimator based on our strong representation result. 
Theorem 2.2. Under the assumptions of Theorem 2.1, if ag < ap, then for 


O<y<b< br, 


~ 


Vni(Bn — 8) N(0,0?), (7) 


where 


ar a. B?{Var(b(X1, Th, y)) +2 > Cov((X1,T1,y), W(X, Ti, y))}- 
i=2 


3 Proofs 


In order to prove Theorem 2.1, we need the following lemma which is Theorem 
2.1 in Sun and Zhou [8]. Note that the proof of (9) is similar to that of (8) and 
is therefore omitted. 

Lemma 3.1. Suppose that {X,;n > 1} is a sequence of a-mixing random 


variables with a(n) = O(n”), for some v > 3. If ag < ap, then 


Puly) = Fly) +1 - FW) = 2 e(XeTy) + Ray) as. 8) 
i=1 
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and 
be ia 
Grly) = G(y) — Gy) D(X, Ti.) + Rnaly) a8, (9) 
i=1 
uniformly in 0 < y < b < bp, where 
sup |Rni(y)| = O(n '/?(logn)~°) as. 
0<y<b 
and 


sup |Rna(y)| = O(n /?(logn)°) — a.s., 
0<ySb 


for some 6 > 0 depending only on v. 
Proof of Theorem 2.1. Using Lemma 3.1, for 0 < y < b < bp, with 
probability 1 for large n, we have 
Benet Gn(y)A — Fry) — Gy) = Fly) 
Cn(y) Cy) 
(1 — F(y))C(y) 


= (1 = F(y)CWGty) ie a | =! n = 
= Gee) Cn Lats) — 7D ke Ty) 


n 


7 ra DWT <9 < Xi) — Cy]} + O(n" logn)™) 
i=1 


1 n 
= By Dv MoT + Of Mog) 9) as. 


Meeeee ae (Xe Tet) + a(Xi Tw) + GUE Sv S Xi) — CW) 
ee eee gee 
= 2OOGi Eds O%(s) 


It is easy to see from Lemma 1 of Cai [1] that {Y(Xi,Ti,y);Ti < Xi, i = 


dF*(s)—1 as. 


1,2,...} is a sequence of stationary a-mixing bounded random variables. The 
random variable 7)(X;, T;, y) does not depend on y, therefore, the proof of Theo- 
rem 2.1 is complete. 

Proof of Theorem 2.2. An application of Theorem 18.5.4 of Ibragimov and 
Linnik Yu [5] and Theorem 2.1 gives (2.4). It can be easily checked that 


x *(g bg« *(6 
Var(p(X1,Ti,y) = ae | Seay teen 


ap 


Asymptotic normality of the truncation probability ... 59 


which is finite under (5). On the other hand, a(n) = O(n-”),v > 3 implies 
>> a(n) < co and therefore 0°, Cov(#(X1,T1,y), P(X: Ti, y)) < co. So, o? isa 


positive finite number and the proof of Theorem 2.2 is complete. 
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